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ABSTRACT 

Large planned photometric surveys will discover hundreds of thousands of supernovae (SNe), out- 
stripping the resources available for spectroscopic follow-up and necessitating the development of 
purely photometric methods to exploit these events for cosmological study. We present a light curve 
fitting technique for type la supernova (SN la) photometric redshift (photo-z) estimation in which the 
rcdshift is determined simultaneously with the other fit parameters. We implement this "lcfit+z" 
technique within the frameworks of the MLCS2k2 and SALTii light curve fit methods and determine 
the precision on the redshift and distance modulus. This method is applied to a spectroscopically 
confirmed sample of 296 SNe la from the Sloan Digital Sky Survey-II (SDSS-II) SN Survey and 
37 publicly available SNe la from the Supernova Legacy Survey (SNLS). We have also applied the 
method to a large suite of realistic simulated light curves for existing and planned surveys, including 
the SDSS, SNLS, and Large Synoptic Survey Telescope (LSST). When intrinsic SN color fluctuations 
are included, the photo-z precision for the simulation is consistent with that in the data. Finally, we 
compare the lcfit+Z photo-z precision with previous results using color-based SN photo-z estimates. 

Subject headings: supernova light curve fitting 



1. INTRODUCTION 

To investigate the expansion history of the universe, 
increasingly large samples of high-quality type la su- 
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pernova (SN la) light curves are being used to mea- 
sure luminosity distances as a function of redshift (the 
SN la Hubble diagram). Expected SN la samples 
will be in the thousands fo r the Dark Energy Sur- 
vey (DES: iBernstein etail (|2009h ) and in the hun- 
dreds of thousands for the surveys to be carried out 
by the Panoramic Survey Telescope and Rapid Re- 
sponse System (Pan-STARRS£3 and by the Larg e 
Synoptic Survey Telescope (LSST: llvezic et all (|2008li : 
ILSST Science Collaborations! (j2009ft ). For the latter two, 
only a small fraction of the SNe will have spectroscopi- 
cally determined redshifts from SN or host-galaxy spec- 
tra for the foreseeable future. To make use of these large 
SN samples, the redshifts will have to be determined pho- 
tometrically using both the SN light curves and the host- 
galaxy photometric observables. 

Methods for estimating galaxy photo-z 's have been 
developed over man y years (for a review, see, e.g., 
lAbdalla et"aT1 (<2008l )V They generally fall into two cat- 
egories: (1) an empirical approach, in which one trans- 
lates observed colors, magnitudes, or other photometric 
observables into a redshift estimate, training the algo- 
rithm on a subset of galaxies with spectroscopic redshifts; 
and (2) template fitting, in which observed colors are 
matched to redshifted template galaxy spectral energy 
distributions until the best match for galaxy redshift and 
type is obtained. The development of photo-z methods 
using SN data is more recent. Some have followed and 
adapted the empirical approach to galaxy photo-z esti- 
mation, e.g., using observed SN colors ne ar the epoch 
of peak brightnes s to estimate the redshift (| Wand [20071 : 
iWang et al.ll2007l ) or early- epoch colors to select SN e in 
particular redshift regions (|Dahlen fc Goobarl 12001) . 

In this paper, we present and describe a method of 

18 http : //pan-starrs . if a. hawaii . edu/public 
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SN la photo-z estimation that is analogous to the tem- 
plate fitting method for galaxy photo-z 's. In models 
used to fit SN la light curves, one typically uses a spec- 
troscopically determined rcdshift, and the fit parame- 
ters are usually taken to be the epoch of peak bright- 
ness, the light curve shape or stretch, a color or dust 
extinction estimate, and the distance modulus. In our 
approach, we extend the usual methods of fitting light 
curves to include the redshift as a fifth fit parame- 
ter. We apply this "lcfit+z" method to determine 
photo-z's for the spectroscopically confirm ed SDSS-II 
Frieman et all 1200a iKessler et all l2009a| ) and SNLS 



Astier et al.l (120061 ): hereafter A06) SNe, and compare 



the resulting photo-z precision to that obtained from 
simulated samples. Once we have verified the reliabil- 
ity of the simulations by comparison with the data, we 
apply the lcfit+z method to simulated LSS T SN ob- 
servations. In all cases, we use both ml cs2k2 (Dha et al.1 
[2007HKessler et al.|[2009i) and saltii (|Guv et al.ll2007[) 
light curve fitting models. 

Variants of the light curve fit approach t o SN photo-z's 
have been used bef ore, by both the S NLS (jSullivan et al.l 
120061 ) and SDSS (|Sako et al.l 120081 ) surveys, to select 
SN la candidates for spectr oscopic follow-up afte r only 
a few photometric epochs. iKim fc Miquell (120071 ) used 
the saltii model and a Fisher matrix analysis to es- 
timate uncertainties on photometric redshifts and dis- 
tances. Using a t e chniqu e similar to our lcfit+z 
method, iGong et al.l (|2010f ) studied LSST simulations, 
focusing mainly on contamination from non-la SNe and 
the resulting precision on cosmological parameters. In 
contrast, we focus here on the precision and bias for the 
photo-z and distance modulus. We also use more real- 
istic simulations based on the LSST cadence for both 
the deep and wide surveys and illustrate some differ- 
ences between these two components of the LSST sur- 
vey. Our estimates of non-la contamination and cos- 
mological precision will be presented i n a fu ture work. 
Recently, FPalanquc -Delabrouille et al . (2009) (hereafter 
PD09) employed the light curve fit photo-z technique 
within the saltii framework. For nearly 300 SNe la 
from the SNLS, they evaluated both the photo-z pre- 
cision and the fraction of catastrophic redshift outliers. 
The PD09 SN sample is by far the largest to date used 
to study SN photo-z methods; our SDSS-II sample, at 
lower redshifts, is of comparable size. 

The plan of this paper is as follows. We introduce the 
spectroscopically confirmed SN data samples (SDSS-II 
and SNLS) in SJ2J In Jll wc describe the simulation, and 
we present the lcfit+z method in detail in The 
photo-z precision and fit-parameter correlations for the 
data samples, along with the corresponding results from 
simulated samples, are presented in $5] We use the sim- 
ulation to make predictions for the LSST survey in |J6J 
In iJT] we make direct comparisons with the color-based 
photo-z method p resented in pr e vious w orks. 

As described in IKessler et al.l (|2009bO . all light curve 
fitting and simulation software is publicly available in the 
SNANA package^ 

2. THE SDSS-II AND SNLS DATA SAMPLES 



http : //www . sdss . org/supernova/SNANA . html 



To test the lcfit+z method, we use the full three- 
season sample from t he SDSS-II Supernova (SN) Survey 
(|Frieman et al.l 120081 ). and the publicly available sample 
from the first season of the SNLS (A06). Below we give 
a brief description of these samples. 
The SDSS-II S N Survey used the SDSS camera 
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on the SDSS 2.5 m telescope 
at Apache Point Observatory to 
search for SNe in the Fall seasons (September 1 through 
November 30) of 2005-2007. This survey scanned a re- 
gion (designated stripe 82) centered on the celestial equa- 
tor in the Southern Galactic hemisphere that is 2.5° wide 
and runs between right ascensions of 20 hr and 4 hr , cov- 
ering a total area of 300 deg 2 with a typical cadence of 
every four nights per region . Images were obtai ned in 
five broad passbands, ugriz (jFukugita et al.lll996f ). with 
55 s exposures an d processed through the PHOTO pho- 
tometric pipeline (|Lupton et al.ll200ll ). Within 24 hours 
of collecting the data, the images were searched for SN 
candidates that were selected for spectroscopic follow-up 
observations in a program involving about a dozen tele- 
scopes. The SDSS-II SN Survey discovered and spectro- 
scopically confirmed a total of ~ 500 SNe la. A larger 
sample of photometrically identified but spectroscopi- 
cally unobserved SNe la was also compiled, and host- 
galaxy redshifts for several hundred of these photomet- 
ric candida tes have been obtain ed to date. The SDSS- 
III Survey (jSchlegel et al. 2009), as a small part of its 
early program, is in the process of measuring host-galaxy 
redshifts for more than 1000 of these photometrically 
identified SNe la. The telescope aper ture, focal plane , 
and exposure time of the SDSS system ([York et al.ll2000f ) 
were ideal for discovering SNe in the previously under- 
explored redshift range 0.1 < z < 0.3. Details of th e 
SDSS-II SN Surv ey are given in iFrieman etaLI (|2008D ; 
iSako et all (|2008l ). the procedures for spectroscopic iden- 
tification and reds hift determinations are described in 
iZheng et al.1 (12008 ) , and the SN photometry is described 
in lHoltzman et all (|2008| ) . A condensed summary of the 
SDSS-II survey, SN typing, redshift deter mination, pho- 
tometry , and calibration can be found in IKessler et al.l 
(I2009al) . 

The SNLS was a five-year survey covering 4 deg 2 us- 
ing the MegaCam imager on the 3.6 m Canada-France- 
Hawaii Telescope (CFHT). Images were taken in four 
bands similar to those used by the SDSS: gM, Tm,Im, zm, 
where the subscript M denotes the MegaCam system. 
The SNLS exposures were ~ 1 hr in order to discover 
SNe at redshifts up to z ~ 1. The SNLS images were 
processed in a fashion similar to the SDSS-II so that 
spectroscopic observations could be used to confirm the 
identities and determine the redshifts of the SN candi- 
dates. We use the publicly available sample from their 
first year of operations that ended July 15, 2004. De- 
tailed information about the SNLS can be found in A06 
and references therein. 

Since high-quality light curves are needed for the 
LCFIT+Z method, we apply the following selection re- 
quirements to the photometric data for inclusion in our 
analysis samples: (1) spectroscopic confirmation of type 
la, (2) a measurement with T rcst < —3 days, (3) a mea- 
surement with T rest > +10 days, (4) measurements in 
at least three observer-frame filters have signal-to-noise 
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ratio (S/N) greater than 8, and (5) the probability cor- 
responding to the fit-x 2 /A r dof (® is Py? > 0.02. Here 
T r ost is the epoch in the SN rest frame relative to peak 
brightness in the B band, and we note that it depends 
on the fitted photo-z value. For the SDSS-II, we use 
only the gri passbands. The number of SNe la satisfying 
these selection requirements is nearly 300 and 40 for the 
SDSS-II and SNLS, respectively; the exact numbers of 
SNe depend on the fitting model (mlcs2k2 or SALTii) 
and will be given in Sj5] The selection criteria above are 
not based on optimizations, but are instead motivated 
by the strong correlation between redshift and color. Re- 
quiring two colors and at least one measurement in each 
of three passbands with S/N > 8 explicitly ensures good 
color measurements. The Xi- os t requirements (< —3 and 
> +10 days) ensure a good determination of the time of 
peak brightness (to); since SNe become redder after peak 
light, a mismeasurement of to translates directly into a 
mismcasurement of color, and hence redshift. 

Here we provide some additional motivation for the 
above requirements. Relaxing the S/N requirement (no. 
4 above) to S/N > 5 results in about 10% more SNe la 
in the SDSS-II sample, and a 20% degradation in the 
photo-z precision. Making a more restrictive cut of 
S/N > 10 results in a 20% loss of events and a negli- 
gible improvement in the precision. We have therefore 
chosen S/N > 8 as a reasonable compromise between 
sample statistics and precision. To motivate the sam- 
pling requirements, we have applied the lcfit+z fitting 
method (Sj4|) to the SDSS-II sample in which all measure- 
ments prior to peak brightness have been rejected; the 
resulting precision and bias on the fitted to and photo-z 
are significantly degraded. The sampling requirements 
above (nos. 2 and 3) are therefore designed to ensure a 
good determination of to- 

3. SIMULATIONS 

We use the SNANA simulation code to generate realis- 
tic SN la light curves that can be analyzed in exactly 
the same manner as the data. The simulation is used to 
compare with the data, to compare with previous photo- 
z studies based on simulations, and to make predictions 
for LSST. All SN simulations are based on a standard 
ACDM cosmology (w = -1, ft M = 0.3, A = 0.7) and 
they are generated and fit using the same light curve 
model: mlcs2k2 or SALTii. This strategy explicitly as- 
sumes that the light curve model is correct, and will 
therefore yield the most optimistic results. As discussed 
below, the models arc adjusted to account for the anoma- 
lous Hubble scatter. However, these models have not 
been adjusted to account for the discre pancies in the ul- 
traviolet region (|Kessler et all (|2009a| ): h ereafter K09) 
Details of the simulation are described in iKessler et al.1 
(I2009bfl and in §6 of K09; here we give a brief overview. 

For MLCS2k2, rest-frame model magnitudes are gen- 
erated from light curve templates and then dimmed 
by host-galaxy dust extinction Usin g SN la spec- 
tral templates from iHsiao et"al~l ()2007l ). if-corrections 
(jNugent et al.ll2002l ) are used to transform the rest-frame 
(U BVRI) model magnitudes into observer-frame magni- 
tudes. For SALTii, the model is based on a time-sequence 
of rest-frame spectra, and observer-frame magnitudes 
are computed by convolution with the appropriate filter- 
response curves. 



Since light curve models are defined over a specific 
wavelength range in the resjt frame, one usually checks 
the rest-frame wavelength Af/(1 + z), where Af is the 
mean wavelength of the observer-frame filter and z is 
the redshift. If the rest-frame wavelength is outside the 
valid range of the model, then the corresponding fil- 
ter is typically ignored. For photo-z applications, this 
method of ignoring filters clearly cannot be used, since 
the redshift is not known ahead of time. To be realis- 
tic, we should not use this filter-ignoring procedure in 
the simulations either. Therefore, our simulations use 
wavelength-extended models that generate fluxes for fil- 
ters with Af/(1 + z) beyond the nominally defined wave- 
length range. For both mlcs2k2 and SALTII, the rest- 
frame wavelengths are extended down to 2500 A. For 
SALTII, the 7000 A upper limit has been raised to 8700 A. 
We note that the simulation and fitter use the same 
wavelength-extended model, and therefore these tests do 
not probe potential problems if the extended part of the 
model is wrong. 

We have implemented two models of intrinsic SN mag- 
nitude variations that produce "anomalous" scatter in 
the Hubble diagram. The source of anomalous scatter 
is unknown, and it is not clear if the scatter can be 
reduced with an improved light curve model, or if this 
scatter is due to some random physical process such as 
brightness variations as a function of viewing angle in an 
asymmetric explosion. For simulations with 10 4 times 
the nominal exposure time and z < 0.5, i.e., for which 
photon noise is negligible, we define the anomalous scat- 
ter to be the rms (RMS^) of the difference between the 
fitted and generated distance modulus (/ifi t — Mgen) from 
the four-parameter light curve fit using spectroscopically 
determined redshifts. The default model, called "color- 
smearing," introduces an independent magnitude fluctu- 
ation in each passband, and the fluctuation is the same 
for all epochs within each passband. A random num- 
ber r f from a unit- variance Gaussian distribution is cho- 
sen for each passband /, and a magnitude fluctuation 
8m f = rf(j{ is added to the generated magnitude at all 
epochs. As described in Sj5j a scatter of <jf = 0.1 mag 
is needed in order for the simulated photo-z precision 
to match that of the data. The resulting Hubble scat- 
ter is consistent with that seen in analyses of spectro- 
scopically confirmed data samples; RMS^ ~ 0.16 for 
the SNLS (griz) simulations, and RMS ;t ~ 0.19 mag 
for the SDSS-II (gri) simulations. The simulated SNLS 
scatter is slightly smaller because this survey results in 
larger S/N values. The second model of intrinsic vari- 
ations is called "coherent luminosity smearing:" a co- 
herent random magnitude shift, drawn from a Gaussian 
with cr co h ~ 0.15 mag, is added to the model magnitude 
for all epochs and passbands. In the coherent smearing 
method the intrinsic model colors are not varied, and the 
resulting anomalous scatter is RMS M = cr co h- 

Although both models of intrinsic magnitude variation 
result in the expected scatter in the Hubble diagram, 
only the color-smearing model can generate the observed 
photo-z precision in the SDSS-II and SNLS data sam- 
ples ($5]). We have not investigated simulations in which 
both models of intrinsic variation contribute, nor have 
we investigated variations in the color parameter (Ry 
for mlcs2k2 or /3 for salth) that could also introduce 
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anomalous scatter. 

To simulate non-photometric conditions and varying 
time intervals between observations due to bad weather, 
actual observing conditions are used for an existing sur- 
vey, or an estimate of such conditions for a planned (fu- 
ture) survey. For each simulated observation, the noise 
is determined from the measured point spread function 
(PSF)H zero point, CCD gain, and sky background. 
Noise from the host-galaxy background is not included. 
The simulated flux in CCD counts is based on a mag- 
to-flux zero point and a random fluctuation drawn from 
the noise estimate. For the SDSS-II and SNLS surveys, 
a detailed treatment of the search efficiency, including 
spectroscopic selection effects, is described in §6.2 of K09. 

The quality of the simulation is illustrated in Fig- 
ures CD and M for the SDSS-II and SNLS surveys, re- 
spectively, using the SN selection requirements described 
in §S] The parameters from each sample have been de- 
termined using the conventional fitting method with a 
fixed spectroscopically determined redshift. Each fig- 
ure shows data-simulation comparisons for the distribu- 
tions of spectroscopic redshift, color parameter (Ay for 
MLCS2k2, c for SALTii), and shape-luminosity parameter 
(A for mlcs2k2, xi for SALTii). The color and shape 
parameters are defined in §3] There is good overall con- 
sistency between the measured and simulated distribu- 
tions. 
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Fig. 1. — For the SDSS-II SN Survey, comparison of data distri- 
butions (dots) with those from the simulation (histograms). Fitted 
parameters for MLCS2K2 (z,Ay,A) are on the left, and for SALTII 
(z,c,xi) on the right. Fits were done with the redshift fixed to 
the true redshift. The simulated histograms are scaled to have the 
same number of entries as the data. 



4. THE lcfit+z METHOD 

The general principle behind the SN la photo- z deter- 
mination is illustrated in Fig.[3]using colors at the epoch 



of peak brightness. Increasing the redshift causes uni- 
form reddening at all wavelengths, while intrinsic red- 
dening (or extinction) causes more reddening at bluer 
wavelengths. A time-dependent light curve model (e.g., 
mlcs2k2 or SALTii) is used to account for the known 
color dependence on the light curve shape (commonly 
known as the stretch) and to use all epochs in order 
to maximize the photostatics in the photo-z measure- 
ment. Since SNe become redder with increasing epoch, 
any error in the epoch of peak brightness results in the 
wrong template colors, and hence an increased error in 
the photo- z. 

For real observations, the ideal color-color bands in 
Fig. [3] are smeared by photon statistics and by intrinsic 
SN color variations that are not described by the light 
curve model. With sufficient smearing, the colors of a 
very reddened SN at z ~ 0.1 are degenerate with a blue 
SN at z ~ 0.3 (see the region enclosed by dotted oval in 
Fig. [3} and we indeed see this degeneracy in the SDSS-II 
photo-z measurements. From data-simulation compar- 
isons, we find that smearing from photon statistics does 
not fully describe the photo-z precision, and we therefore 
propose that the modeling of intrinsic color variations is 
not adequate. (§03). 

While host-galaxy photometric redshifts depend on the 
determination of the 4000 A break, a similar ~ 2800 A 
break in the SN spectrum has little impact on the photo- 
z measurement because this feature is either inaccessible 
at low redshifts, or it is poorly measured at higher red- 
shifts. In current SN la models, fluxes at these very blue 
wavelengths are either ignored, or they are heavily down- 
weighted relative to the optical bands. 

The basic idea of the lcfit+z method is to start 
with a light curve fit model that has four free param- 
eters when the redshift is fixed to an accurately mea- 
sured value, and simply float the redshift as a fifth fitted 
parameter. We refer to these methods as MLCS2k2+Z 
and SALTii+z, where MLCS2K2 and SALTii refer to the 
conventional models in which the redshift is fixed to a 
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Fig. 2. — Same as Figure [T] but for the SNLS survey, using the 
data sample from A06. 
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Fig. 3. — Observer-frame r — i vs. g — r at peak brightness for 
SNe la using the SALTII model. Each set of color bands corresponds 
to the indicated redshift (0.1, 0.2, 0.3), and the variation within 
each band corresponds to variations in the intrinsic SN color (i.e., 
the SALTII c parameter). The solid and open circles correspond to 
a high-stretch and low-stretch SN, respectively. The dotted oval 
shows a degenerate region discussed in the text. 

precisely measured value. Compared to the color-based 
photo- z method, advantages of the LCFiT+z method in- 
clude a natural framework for tracking correlations be- 
tween redshift and distance modulus ( H5.2[) , and using 
all of the light curve information (instead of just peak 
flux) so that in principle the intrinsic SN color varia- 
tions can be accounted for. The main advantage of the 
color-based method is that it works over a broader red- 
shift range, and there is no need to worry about which 
observer-frame filters map into a valid wavelength range 
in the rest frame. 

For mlcs2k2+z, the five fitted parameters (which 
we denote with the vector afg) are the time of max- 
imum brightness in the (rest frame) £>-band (to), the 
shape-luminosity parameter (A), the host-galaxy extinc- 
tion in the V band (Ay), the distance modulus (/x), 
and the redshift (z p h t)- We use a flat Ay prior and 
R v = A V /E(B - V) = 2.2. For SALTii+z the five pa- 
rameters are the time of maximum brightness in the rest 
frame B-band (to), the shape-luminosity parameter (xi), 
the B — V color (c), the flux normalization (xo), and the 
redshift (z pilo t)- The following light curve fit \ 2 is mini- 
mized using MiNunPI: 
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+ 21n(<7 i /<f i ) \ , (1) 



and the corresponding probability (e x / 2 ) is used to 
marginalize as described in Appendix [B] Here, i^ data 
is the SN flux of the ith observation, F; modcl (x5) is the 
predicted flux using the five model parameters (£5), and 
is the quadrature sum of the mea- 
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sured and model uncertainties, respectively. The index 
% runs over all epochs and filters. The second term in 

21 http: / /wwwasdoc .web . cern. ch/wwwasdoc/minuit/mininain.htinl 
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Fig. 4. — Model uncertainty vs. rest-frame epoch for the U 
(solid) and B (dashed) passbands. 

Eq. Q] accounts for model uncertainties that depend on 
the rest-frame passband and epoch, which in turn depend 
on the photo- z value. The model uncertainties are shown 
in Fig. @] for the rest-frame U and B passbands. The ref- 
erence uncertainty (ct^) from the first iteration of the fit is 
used in the denominator so that the second term is close 
to zero in the second iteration; although the oi do not 
affect the minimization, these terms reduce the change in 
the calculated fit probability. As explained below, the % 2 
is minimized twice in order to include the appropriate fil- 
ters and epochs. The minimized values and uncertainties 
are then used to estimate the integration ranges needed 
to obtain marginalized results. 

In this study, we use the mlcs2k2 and SALTII light 
curve fitters that have been implemented in the SNANA 
package. The main reasons for using the SNANA code are 
(1) exactly the same light curve model is guaranteed to 
be used in both the light curve fits and in the generated 
simulations, (2) there are several improvements to the 
MLCS2k2 light curve fitter as explained in K09, (3) the 
photo-z implementation is identical for both models, (4) 
the SNANA fitter is significantly faster than the original 
fitting software. To check the SALTII implementation in 
SNANA, we have repeated the light curve fits and cosmol- 
ogy analysis for the six sample combinations in K09 and 
find that the dark energy equation of state parameter w 
is always within a few hundredths of the value obtained 
with the original code; these discrepancies are well below 
the statistical uncertainties. 

Although it is straightforward to include the redshift 
as a free parameter in the light curve fit, there are sub- 
tle pre-fit issues related to the unknown redshift value: 
(1) SN selection criteria that depend on knowing T rest 
= 7obs/(l + ^)23 such as requiring measurements with 
a minimum and maximum T rcst value; (2) as noted 
above, determining which observer-frame filters (with 
mean wavelength Af ) have Af /(l + z p i lot ) within the valid 
wavelength range of the fitting model; (3) determining 
the valid rest-frame epoch range for the fitting model; 
(4) as Af/(1 + Zphot) maps into a different rest-frame 
Alter (for MLCS2k2) there is a discontinuous change in 
the model error, and therefore the x 2 is n °t a continu- 
ous function of z p hot; and (5) determining robust initial 
fit-parameter values. Our treatment of these issues is 
described in Appendix [Al 

We end this section with a discussion of the processing 
time. For the SDSS-II light curves, which have nearly 
50 measurements on average, all of the minimization fit- 

22 T r est and T orjs are the rest-frame and observer-frame times in 
days since peak brightness in the B band. 
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iterations take ~ 1 s per SN using minuit. The marginal- 
ization (Appendix [BJ takes close to half a minute per SN 
using an integration grid of 11 points per fit parameter, or 
a total of ll 5 integration cells. However, the integration 
ranges usually need adjustment after marginalizing, and 
therefore the marginalization typically runs twice, taking 
nearly a minute per SN. The processing time scales lin- 
early with the number of measurements and as the fifth 
power of the number of grid points per fit parameter. 
For the integration grid above, the Monte Carlo Markov 
Chain technique requires about the same amount of pro- 
cessing time. 

Using the SNANA implementation of MLCS2k2+z and 
SALTii+z, we find that minuit gives adequate minimized 
values, but that the uncertainties are not reliable because 
of subtle discontinuities in the \ 2 derivative with respect 
to the photo-z. We must therefore marginalize in order to 
get useful uncertainties and covariances. To illustrate the 
computational issue more clearly, consider an LSST sam- 
ple of 500,000 SNe la. The marginalization for all of the 
SNe in this sample requires a total of 1 CPU-year. A fac- 
tor of 100 is probably needed for code development and 
systematic studies and another factor of several for sim- 
ulation studies. The total computing needs are therefore 
a few CPU centuries with today's processors, assuming 
that 11 5 integration cells give sufficient accuracy. To use 
the minimization, which reduces the computing needs to 
a few CPU years, the light curve model-magnitudes and 
errors must be continuous functions of rcdshift, as well 
as their derivatives. 

5. RESULTS FOR SDSS-II AND SNLS 

Here we present results for the three-season SDSS-II 
data and the first-season SNLS data described in $21 and 
we compare with results from simulations of those same 
samples. There are two fit minimizations (Q to deter- 
mine the appropriate filters to include. The photo-z and 
distance-modulus results are taken to be the mean of 
their respective probability distribution functions (pdf) 
marginalized over the other fit parameters using a grid 
of ll 5 integration cells (Appendix IB"]) . The uncertainty is 
taken to be the rms of the pdf. A prior on the host-galaxy 
photo-z or SN color could potentially improve the pre- 
cision of the method and reduce the frequency of catas- 
trophic SN photo-z outliers; we have not used such priors 
here in order to better illustrate the performance of the 
LCFIT+Z method on its own. 

Following a commonly used practice in the literature, 
we characterize the precision of the lcfit+z photo-z pre- 
cision with the quantity 

A z = (Zphot — z spec)/(l + ^spec) , (2) 

and we use RMS a, to denote the root mean square of 
the distribution of A z . The SDSS-II results are shown in 
Figures [5] and [5] for the MLCS2k2+z and SALTii+Z meth- 
ods, respectively. There are fewer SNe in the sample fit- 
ted with SALTii because the more restrictive rest-frame 
model wavelength range (2900-7000 A) rejects i-band 
data for redshifts below about 0.1; without i-band, these 
low-redshift SNe fail the requirement of three observer- 
frame filters. The SNLS results are shown in Figures [7] 
and [51 Distributions from the data and simulated sam- 
ples are shown side-by-side in these Figures, illustrating 
the reliability of the simulations in predicting the dis- 



persions and bias. Each plot shows the number of SNe 
satisfying the selection criteria, RMS a. (overall and ver- 
sus Zgpec), and the rcdshift bias (overall and versus z spe c). 
The RMS A , values are ~ 0.04 for both the SDSS-II and 
SNLS samples. 

For the mlcs2k2+z method applied to the SDSS-II 
sample (Fig. [5]), the overall A z bias in the data is notably 
larger than in the simulation, and there is a redshift- 
dependent bias that is partially predicted by the sim- 
ulation. There are two contributions to the A 2 bias. 
First, there is a degeneracy between intrinsic redden- 
ing (due to extinction or color) and redshift. For red- 
shifts z spcc > 0.3, the best-fit photo-z is sometimes near 
Zphot ~ 0.15, and the best-fit color corresponds to a very 
red and intrinsically dim SN la. This degeneracy is sen- 
sitive to the S/N; the resulting bias is redshift dependent 
and is well modeled by the simulation. 

To potentially identify fits with a catastrophic photo- 
z error resulting from a strong color-redshift degener- 
acy, we have looked for a second maximum in the one- 
dimensional marginalized pdf. For MLCS2k2+Z, ~ 9% of 
the fits have a second maximum in both the photo-z and 
color pdf, and for SALTII+Z the corresponding fraction 
is ~ 4%. For both models, the photo-z precision is the 
same for the subset with a second maximum in the pdf, 
and therefore this approach cannot be used to identify 
photo-z outliers in the SDSS-II sample. We also find no 
correlation between the photo-z uncertainty and catas- 
trophic outliers. Clearly a reliable host-galaxy photo-z 
prior will help reduce catastrophic outliers, and this in- 
formation will be used in future analyses that include 
spectroscopically unconfirmed SNe la in the Hubble dia- 
gram. 

The second contribution to the A z bias is related to the 
£/-band anomaly discussed in K09, and this source of bias 
is not modeled in the simulation. The sub-sample with 
z > 0.2, where the observer frame g band corresponds to 
the rest-frame UV region, has a bias larger than the av- 
erage. The sub-sample with z < 0.2 has a much smaller 
bias. 

For cosmological applications, it is of interest to study 
how the use of photo-z's in place of spectroscopic red- 
shifts impacts the determination of SN distances. Dis- 
tance modulus (/i) dispersions for fits with both spec- 
troscopic and photometric redshifts are shown in Fig. [9j 
where RMS M is the rms scatter of the distribution of 
A*fit — Mref ■ The reference distance modulus (// re f) is cal- 
culated from the spectroscopic redshift and the same 
standard cosmology used in the simulation: w = — 1, 
Qyi = 0.3, Ha = 0.7. For mlcs2k2, /i fit is the fitted 
distance modulus. For SALTII, /igt is defined to be 

/i|f LT2 ~ 30 - 2.51og 10 (.T ) +a-xi-p-c, (3) 

where a = 0.11 and /3 = 2.6 are fixed parameters from 
the simulation. Note that the particular choice of a and 
ft does not affect the /x dispersion. Compared to the four- 
parameter light curve fits using spectroscopic redshifts, 
the lcfit+z method increases RMS M by 0.1 to 0.2 mag. 

To gauge the appropriate level of intrinsic magnitude 
variations in the simulation, we compare the A z pre- 
cision in the data to that in two different simulated 
samples. The first sample is generated with 0.1 mag 
color-smearing, and the second sample is generated with 
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0.2 mag coherent smearing. The width of a Gaussian fit 
to the A z distribution (uaJ is used instead of RMSa, to 
reduce sensitivity to outliers. The OA- results arc shown 
in Table Q] for the SDSS-II and SNLS data and for the 
simulated samples. Using coherent smearing, the simu- 
lated photo-z precision is significantly better than that 
of the data, while the color smearing model matches the 
data well. For the small SNLS data sample fitted with 
MLCS2k2+z, the large uncertainty on oa z is due to a 
statistical anomaly in the distribution that results in a 
poor fit to a Gaussian. 

This empirical estimate of random intrinsic color dis- 
persion needed in the simulation does not necessarily sug- 
gest that there are random color variations in SN light 
curves, but rather that there arc additional sources of 
color variation that are not captured by the light curve 
models. Using t he ne arby SN la sample (z < O.f), 
iNobili fc Goobarl (|2008| ) also found evidence for intrin- 
sic color dispersion. They fit the SNe with a light curve 
model that includes many more color parameters than 
MLCS2k2 or SALTii, and their estimate of the color dis- 
persion is considerably smaller than our empirical esti- 
mate based on matching the photo-z precision. 



Photo- 
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Precision <t a J3 for Data and Simulations 
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choice between priors and selection criteria is not ad- 
dressed here and will need further study. 

The PD09 sample is based on nearly 300 SNe la cor- 
responding to the first three seasons of SNLS, while we 
use the publicly available sample from A06. To make 
the selection criteria more similar for this comparison, 
we have relaxed our requirement on the maximum S/N: 
three filters must have at least one measurement each 
with S/N> 5 (instead of 8). Our modified selection re- 
sults in 55 SNe la: 13 with z < 0.45 and 42 with z > 0.45. 

To evaluate the photo-z precision, we use the PD09 
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b Uses color-smearing model with <Tf =0.1 mag. 
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Fig. 5. — Redshift precision for SDSS-II sample using 
MLCS2K2+Z. Left plots are for data; right plots for simulation. Top 
plots show the A z distribution for the entire sample; the number 
of events, rms, and bias are indicated on the plot. Middle plots 
show A z vs. Zspec- Bottom plots show the bias and RMSa z in 
redshift bins. 



5.1. Comparison with Recent SNLS Photo-z Results 

Using data and simulations for the SNLS, we compare 
our photo-z precision with recent results from PD09. 
They use the SALTII+Z method in a manner very sim- 
ilar to ours. The differences between our method and 
theirs are: (1) they use all four griz filters, while we use 
only those filters that correspond to the valid rest-frame 
wavelength range; (2) their initial parameter scan is in 
redshift only (Az = 0.1 bins), while our initial scan is 
over a two-dimensional grid of redshift (Az = 0.04 bins) 
and color (Ac = 0.2 bins); (3) they impose priors on the 
color and redshift, while we do not use priors; and (4) 
they impose less restrictive light curve selection require- 
ments, including the addition of photometrically iden- 
tified SNe la (i.e., spectroscopically unconfirmed SNe). 
The trade-off between the use of priors versus selection 
criteria mainly affects the rate of catastrophic photo-z 
outliers. Our choice of using flat priors is intended to 
better illustrate the performance of the saltii+z method 
on higher-quality light curves. While the relaxed cuts in 
PD09 have the advantage of increasing the sample size, 
using a prior on color (or on host-galaxy extinction) re- 
quires a detailed understanding of the underlying color 
distribution and survey selection function. The optimal 
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FlG. 6. — Same as Fig. [5] but using SALTII+Z. 
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metric ctaz/m+z) = 1-48 x mcdian|A 2 |, where A z is de- 
fined in Eq. [2j This quantity is much closer to the Gaus- 
sian sigma of A 2 than to RMSa,- To quantify the rate 
of catastrophic photo-z outliers, wc define r\ x as the frac- 
tion of SNe with a photo-z that satisfies |A Z | > x 1 and 
we follow PD09 in using x = 0.15. 

Table [2] compares our precision metrics with those in 
PD09. We also give the <Jaz/(i+z) breakdown for the low- 
rcdshift (z spC c < 0.45) and high-redshift (z spcc > 0.45) 
ranges. We caution that these comparisons are based on 
different data samples, different selection criteria, and 
different priors. For the data comparison, the two analy- 
ses are reasonably consistent in both paz/(i+z) an d ?7o.i5 
for both redshift ranges. For the simulation compar- 
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-9.4 ±7.4 I [ 



SNLS - MLCS 

N S |M = 224 
rms = 0.035 
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SIM 



ison there is a subtle disagreement. The PD09 sim- 
ulation, which uses coherent mag-smearing, underesti- 
mates the scatter in the low-redshift range but accurately 
predicts the precision in the high-redshift range. Our 
simulation using coherent mag-smearing underestimates 
°Az/(i+z) ( m PD09) for both redshift ranges, but our 
simulation based on color-smearing predicts <JAz/(i+z) 
fairly well, perhaps with a slight overestimate of the 
scatter. Our simulation supports our earlier conclu- 
sion that color-smearing is needed to model the photo-z 
precision; the PD09 simulation supports our conclusion 
in the low-redshift range but not in the high-redshift 
range. Finally, our simulation underestimates the frac- 
tion of catastrophic outliers (770.15 ~ 0.002), while the 
PD09 simulation gives good agreement with the data 
(770.15 ~ 0.01). 



TABLE 2 

photo-z Precision for SNLS Data and Simulation. Results 

FOR PD09 AND THIS WORK ARE SHOWN. OUR RESULTS INCLUDE 
THE UNCERTAINTY IN PARENTHESES. 
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Fig. 7. — Same as Fig. [5] but using the SNLS sample 
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Fig. 8. — Same as Fig. [5] but using the SNLS sample and 
SALTII+Z. 
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fraction of SNe with \z phot — z sp oc|/(l + z spcc ) > 0.15. 
columns in Table 1 of PD09. 



5.2. Photo-z Correlations 

Here we briefly discuss photo-z correlations that should 
be propagated in a Hubble diagram analysis. Using 
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Fig. 9. — RMS M for four-parameter MLCS2K2 and SALTII fits 
(bottom of each arrow) and for five-parameter MLCS2K2+Z and 
SALTII+Z fits (top of each arrow). The left panel shows RMS M for 
the data; the right panel shows the simulations. 
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the lcfit+z results from the SDSS-II sample, Fig. [TO] 
shows reduced correlations (p) between the photo- z and 
each of the other four light curve fit parameters. The 
MLCS2k2+z photo-z correlation with to and distance 
modulus (upper plots) are both peaked at large positive 
values, the correlation with extinction (Ay) is negative, 
and there is little correlation with the shape parame- 
ter A. For SALTii+z (lower plots) the photo-z correla- 
tions with to, shape/stretch parameter (x±), and color are 
qualitatively similar to those based on the MLCS2k2+Z 
method. The pi x o,z) correlation has a very broad dis- 
tribution with an average near zero. For mlcs2k2+z, 
the simulated distributions match the data well, while 
for SALTii+z there is a slight discrepancy in the distri- 
butions of P(c.z) and PxO,z- This discrepancy occurs only 
for z sp ec > 0.2 and could be an artifact of the subset with 
smaller S/N. 



MLCS2k2+Z Fit Correlations for SDSS 




Fig. 10. — Reduced correlations (p) between the photo- z (Z) and 
the other four light curve fit parameters as indicated in the title of 
each plot. Top plots are for the SDSS-II sample using MLCS2K2+Z; 
bottom plots are for the SALTII+Z method. The data are shown by 
dots; the simulation is shown by the histogram. 



6. PREDICTIONS FOR LSST 

Since we have demonstrated that the SNANA simulation 
can be used to reliably determine the photo- z precision 
for the SDSS-II and SNLS samples, we now turn our 
attention to forecasts for the LSST survey. LSST will 
discover far more SNe than spectroscopic resources can 
target, and photometric methods will be needed to deter- 
mine both the rcdshift and SN type for the majority of 
events. Here we determine the precision (bias and rms) 
on the photo-z and distance modulus using the lcfit+z 
method on more than 10 4 simulated SNe la (after se- 
lection requirements) corresponding to the LSST-DEEP 
and LSST-MAIN surveys. Contamination from non-la 
SNe and the resulting precision in cosmological parame- 
ters will be presented in a future work. 

The DEEP survey comprises seven fields, each cover- 
ing nearly 10 deg 2 , that is densely time-sampled in all of 
the ugrizY LSST filters. After rejecting passbands with 
invalid Af/(l + z p hot), the average number of observations 
per SN la used in the light curve fit is 66. The MAIN 
survey covers more than 20,000 deg 2 but is not opti- 
mized for SN observations, so the light curve sampling 



often has large temporal gaps for each filter. The mean 
number of fitted observations per SN for the MAIN sur- 
vey is 20, more than a factor of 3 fewer compared to the 
DEEP fields . Mor e details about the LSST are given i n 
llvezic et all (l2008h : ILSST Science Collaborations! (l200l . 
Due to the extensive computing resources needed to 
marginalize these large LSST samples, wc have only per- 
formed the minimizations that are adequate for deter- 
mining central values for the fitted parameters. 

To simulate observing conditions, we use the out- 
put of version OP SIM1.29 of the LSST Operations 
Cadence Simulator (iDeleado et aL (|2006| ) and §3.1 of 
ILSST Science Collaborations! (|2009l )). for the cadence, 
sky noise, and 5cr limiting magnitude for each measure- 
ment For each observation, the SNANA simulation requires 
a zero point (Z pe .) to translate the simulated SN mag- 
nitude (m) into an observed CCD flux measured in pho- 
toelectrons, F = lCT - 4 ( m - z p ■»■). In terms of the OP- 
SIM1.29 parameters, we calculate this zero point to be 

Zp.e. = 2M 5aps - M sky + 2.5 log 10 (i4 • (S/N) 2 ) 

+ 2.51og 10 U+A" 1 x up-«M* r -M* rp .)'] , (4) 

where M.§a VS is the 5<r limiting magnitude, M.sk y 
is the Perry sky brightness (mag/arcsec 2 ), A = 
[2tt J[PSF(r)] 2 rrir]- 1 = (1.51 • FWHM) 2 is the effective 
aperture (in arcsec 2 ) where FWHM describes the seeing, 
and S /N= 5 is the signal-to- noise ratio corresponding to 

For this study, we carry out the light curve fits both 
with and without a host-galaxy photo-z prior; no other 
priors are used. The host-galaxy prior is determined from 
the Bayesi an Photometr ic Redshift Estimation (BPZ) 
technique (jBenitez! 120001 ) applied to a preliminary set 
of simulated galaxies. The galaxy colors and lumi- 
nosities are generated to match observed distributions 
as a function of redshift. For more details, see §3.8 
of ILSST Science Collaborations! (|2009| ). The signal to 
noise as a function of apparent magnitude for the host- 
galaxy simulation is based on co-added exposures for 10 
years of the MAIN survey, and photo-z's arc determined 
for galaxies with r magnitudes down to 25. The av- 
erage host-galaxy photo-z precision from BPZ is 0.02, 
and there are some variations with redshift as shown in 
Fig. HU The z prio t — z gcn bias versus redshift has wiggles 
of order 0.005, but the true bias could be larger if the 
SN host galaxies are not a random subset of the galaxies 
used for photo-z training. Note that in this LSST dis- 
cussion of the host galaxy and SNe la, we characterize 
the photo-z precision in terms of z p hot — z gen instead of 
A,. 

These host-galaxy photo-z values are stored in a li- 
brary for the SNANA simulation. For each simulated SN 
with true redshift zsn, the host galaxy with true red- 
shift (zg a i) closest to zsn is selected. The corresponding 
host-galaxy photo-z is then scaled by the ratio zsN/z ga i 
to correct for the slight redshift mismatch between the 
SN and the host galaxy. The scaled host-galaxy photo-z 
and its uncertainty are used to impose a Gaussian prior 

in LCFIT+Z. 

To ensure well-sampled light curves for the lcfit+z 
method, we apply the following selection requirements to 
the simulated LSST SN data: (1) at least two filters with 
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a measurement satisfying T rcst < —5 days; (2) at least 
two filters with a measurement satisfying T rost > +20 
days; (3) largest rest-frame gap (that overlaps —5 to +20 
days) is < 15 days; (4) at least three observer-frame fil- 
ters have an epoch with S/N > 10; and (5) the light curve 
fit probability satisfies P x 2 > 0.02. The requirement of 
at least two filters (cuts (2) and (3)) removes poorly sam- 
pled light curves predominantly from the MAIN survey. 
Using these requirements, the number of SNe per year 
is 1900 and 5 x 10 4 for the DEEP and MAIN surveys, 
respectively. We note that these selection requirements 
are based on educated guesses rather than an optimiza- 
tion procedure. Example light curves from the DEEP 
and MAIN surveys are shown in Fig. Q21 

As a test of the fitting software, we first simulate ideal 
DEEP-ficld samples with no intrinsic mag-smearing and 
the exposure time artificially increased by a factor of 
10 4 compared to the nominal exposure. The resulting 
photo-z bias is less than 0.001 at all redshifts for both 
MLCS2k2+z and SALTII+Z, and the photo-z dispersion 
(rms) is less than 0.003. The bias on the distance modu- 
lus varies between and 0.01 mag for mlcs2k2+Z, and 
is less than ±0.005 mag for SALTii+z. The distance mod- 
ulus dispersion is ~ 0.02 mag for both fitting models. 

6.1. Results for LSST Simulations 

The SN photo-z residuals as a function of redshift are 
shown in Fig. [13] for a simulation without intrinsic mag- 
nitude fluctuations and for a simulation using the same 
intrinsic color fluctuations needed to match the photo-z 
precision for the SDSS-II and SNLS data samples ($5|). 
Each panel shows the residuals as a function of fitting 
method (mlcs2k2+z or SALTii+z), survey field (DEEP 
or MAIN) and redshift prior (flat or host-galaxy photo- 
z). With no intrinsic fluctuations, the most notable 
effects are: (1) with a flat redshift prior, the extreme 
photo-z outliers extend up to |z p hot — z gen| ~ 0.2, and 
(2) the host-galaxy photo-z prior significantly reduces 
the number of outliers. With default color fluctuations, 
there is a notable increase in the photo-z outliers. In both 
cases, the redshift range is higher for SALTii+z, because 
it extends to a lower rest-frame wavelength (2900 A) than 
MLCS2K2 (3200 A). 

0.08 t 1 




0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 
^gen 

Fig. 11.— For 6 X 10 4 simulated LSST host galaxies, BPZ bias 
(crosses) and rms (curve) of z p hot — ^gon vs. the true host-galaxy 
redshift (z gen ) in redshift bins of width Az = 0.03. 



We quantify the rate of catastrophic outliers using 770.15 
and 770.10 (see ^5.1|) . With ~ 10 4 SNe per sample, the 
approximate uncertainty is a v ~ y/77/lOO. Without in- 
trinsic mag-smearing (Fig. U3U eft). 770.15 = in all cases. 
In this case, for the MAIN survey, 770.10 ~ 0.01 with- 
out a host-galaxy photo-z prior, and 770.10 is xlO smaller 
when the host-galaxy photo-z prior is used. Using the 
default color-smearing model in the simulation (Fig. 1131 
right), and fitting without a host-galaxy photo-z prior, 
770.15 < 0.001 for the DEEP survey, and it is somewhat 
larger for the MAIN survey: 0.004 and 0.012 using the 
MLCS2k2+z and SALTii+z methods, respectively. For 
770.10, the corresponding fractions are x4 larger. Using 
the SALTii+Z method, the number of outliers is about x3 
larger compared to MLCS2K2+Z; this difference could be 
related to the model, but it could also be an artifact of 
our implementation. We therefore make no claims that 
either method is more precise or has fewer catastrophic 
outliers. When the host-galaxy photo-z prior is used, 
770.10 = V0.15 = in all cases. 

To quantify the precision of the lcfit+z method, we 
have evaluated the bias and rms spread as a function of 
redshift for both the photo-z and distance modulus (/j,). 
Figure [14] shows the results based on simulations using 
the coherent mag-smearing model, and Fig.lLolshows the 
results using the default color-smearing model. The main 
differences between using these two models of intrinsic 
variations are: (1) the photo-z rms goes to nearly zero 
at low redshift for the coherent mag-smearing model, but 
has a floor of about 0.01 for the color-smearing model, 
and (2) the rms is slightly larger at high redshift for the 
color-smearing model. 

Here we briefly summarize the precision based on sim- 
ulations using the default color-smearing model and fit- 
ting without a host-galaxy photo-z prior (see "FLATZ" 
panels in Fig. [T5| . In the DEEP survey, the photo-z rms 
precision is ~ 0.01 at low redshifts and rises to about 0.04 
at the highest redshifts. The RMS M precision is near the 
0.15 mag floor at low redshifts and roughly doubles at 
the highest redshifts. In the MAIN survey, the photo- 
z precision is about x2 worse compared to the DEEP 
field survey. The corresponding RMS M precision is about 
0.2 mag at the lowest redshifts and also roughly doubles 
at the highest redshift. The photo-z bias in the DEEP 
survey has wiggles of amplitude ~ 0.01 as a function of 
redshift. The ^-bias wiggles are at most at the 0.01 mag 
level, and are notably less apparent than those seen in 
the photo-z bias. In the MAIN survey, the bias is no- 
ticeably larger and rcdshift-dcpcndcnt; the photo-z bias 
reaches 0.02 and the 77-bias reaches 0.1 mag. Both the 
photo-z and y, biases are largest at z gen ~ 0.6. 

When fitting with a host-galaxy photo-z prior (see 
"HOSTZ" panels in Fig. fTS"]) , the precision is significantly 
improved for both the photo-z and fi. In the MAIN 
survey, however, a redshift-dependent bias remains for 
Zg Cn > 0.6. 

Although the LSST photo-z precision looks promising, 
we urge some caution in the interpretation of these simu- 
lations. If we consider the LSST DEEP-ficld subset over 
the same redshift range as the SDSS-II sample (z < 0.4), 
the forecast LSST photo-z precision (rms) is about x3 
better than that for the SDSS-II data sample described 
in Sj5] reflecting the higher expected signal to noise and 
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Fig. 12. — Left two panels show typical SN la light curves simulated for the LSST-DEEP fields; right two panels show typical light 
curves for the LSST-MAIN fields. The redshift is indicated on the top of each plot. Dots are simulated fluxes, the solid curve is the best-fit 
MLCS2K2+Z model, and the dashed curves are the ±lcr error bands for the model. 



broader wavelength coverage. Although we arc confident 
in extrapolating the rms precision based on the treat- 
ment of photon statistics and color smearing, we cannot 
rule out unknown systematic effects, primarily from the 
unknown source of intrinsic brightness variations, that 
could limit the photo-z precision and accuracy. Con- 
cerning the photo- z bias presented here, this should be 
considered a lower limit because the same light curve 
model has been used in both the simulation and in the 
lcfit+z fit. The current MLCS2K2 and SALTii models 
have been shown to differ significantly in the ultraviolet 
region (K09), but such modeling errors have not been 
considered here. In future studies, fitting SALTII simu- 
lations with MLCS2K2 (and vice-versa) would likely give 
an upper limit on the photo- z bias, and may lead to ad- 
ditional clues about problems in the light curve models. 

7. COMPARISONS WITH COLOR-BASED REDSHIFT 
ESTIMATES 

Here we compare our lcfit+Z results with color-based 
SN redshift estimates for the SNLS sample and for an 
MLCS2K2-based simulation. 

7.1. Comparison with the SNLS Sample 

iWand (|2007|) determined color-based photometric red- 
shifts for 40 SNe la from the SNLS A06 sample. For the 
20 SNe used in the training, the author finds RMSa, = 
0.03; for the remaining 20 SNe, RMS A , = 0.05. * In 
our mlcs2k2+z analysis, 37 SNe satisfy the selec- 
tion criteria, and RMSa z = 0.045. For the SALTii+z 
method, 37 SNe satisfy the selection requirements, and 
RMSa z = 0.040. By this metric, our lcfit+z method 
works equally well compar ed to the colo r-based method. 
Since a list of SNe used bv iWanj (|2007t ) is not available, 
we cannot make a more detailed comparison with the 
same subset. 

7.2. Comparison with MLCS2K2-based Simulations 

We compare photo- z re sults of the two m ethods on 
simulations as described in I Wang et al.l (|2007P ) (hereafter 



WNW07) . Following the procedure in WNW07, we simu- 
late observer-frame filters riz using the MLCS2k2 model 
for redshifts z < 0.95 (so that r-band data are always 
well defined within the model), fix the shape-luminosity 
parameter A = 0, generate a fiat redshift distribution, 
and ignore intrinsic magnitude variations that introduce 
anomalous Hubble scatter. For each SN and each filter, 
the exposure time is adjusted so that S/N = 25 at the 
epoch of peak brightness. In WNW07, only the peak 
fluxes are used and therefore the light curve sampling 
does not matter; to investigate the comparative effective- 
ness of our mlcs2k2+Z method, simulated light curves 
are sampled every 7 days in the observer frame. 

In the ideal case of no host-galaxy extinction (Ay = 0), 
WNW07 find RMS A , = 0.004 with a mean bias of 
5.4 x 10~ 4 ; using MLCS2k2+z under similar assump- 
tions, we find RMSa^ = 0.006 and a mean bias of 
(— 1.7 ± 2.1) x 10 ~ 4 . Including host-galaxy extinction in 
the simulation with a pdf P(Ay) = exp(— Ay/0AQ) and 
reddening parameter R v = 3.1, WNW07 find RMS a, = 
0.044 with a mean bias of 0.008; using MLCS2k2+z, un- 
der the same conditions wc find RMSa^ = 0.009 and a 
mean bias of (0.5 ± 2.9) x 10~ 4 . The significantly im- 
proved precision with our MLCS2k2+z method in these 
more realistic conditions is in part due to the increase in 
effective signal to noise that comes from using the entire 
light curve. In addition, shape and color information con- 
tained in the light curve enables the MLCS2k2+z method 
to partially untangle color variations from extinction ver- 
sus those produced by redshift. 

Since the color-based method in WNV07 cannot un- 
tangle reddening from extinction and redshift, we have 
attempted a more fair comparison to the MLCS2k2+Z 
method in which Ay is fixed to the mean generated ex- 
tinction of 0.46 mag; in this case RMSa. nearly doubles 
to 0.015 for the MLCS2k2+Z method, yet is nearly a fac- 
tor of 3 smaller than the dispersion in WNV07. This 
difference is either due to the enhanced photon statis- 
tics from using the entire light curve in the MLCS2k2+Z 
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Fig. 13. — Zphot — ^gcn residuals vs. z gen for the model (MLCS2K2or SALTll), survey (DEEP or MAIN), and photo-z prior (host galaxy 
or flat) indicated in each panel for LSST simulations. Each pair of plots compares residuals with no photo-z prior (FLATZ) to residuals 
with host-galaxy photo-z prior (HOSTZ). The fraction of catastrophic outliers (»?o.iO , 0.15)1 indicated in each panel, is denned in the text. 
The simulations are generated without intrinsic mag-smearing (left plots) and with the default color-smearing model (right plots). 
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Fig. 14. — Bias (crosses) and rms (curve) on photo-z residual (.Zphcrt — ^gen), and on distance modulus residual (fiat — Mgcn) vs. the 
true redshift (z gcn ) in z-bins of width AZ = 0.03 for the LSST simulations. Each plot indicates DEEP or MAIN fields, and FLATZ (SN 
only) or HOSTZ (host-galaxy photo-z prior). The simulation uses coherent (0.15 mag) brightness fluctuations in all filters and is fitted 
with MLCS2K2+Z (left) and SALTII+Z (right). For z < 0.15, the SALTll curves drop to zero because there are only two valid filters (gr), 
and the data fail the three-filter requirement. The dashed horizontal lines in the /ifj t — figen plots show the rms contribution from intrinsic 
variations. 



method or from non-optimal training in the color-based 
method. To increase RMS A , to the WNV07 value of 
0.044, the peak S/N must be reduced from 25 to less 
than 10. 

In the mlcs2k2+z fits we have used Ry = 3.1, the 
same value used in generating the simulation. Using the 
correct value for Ry is an apparently unfair advantage 
over the WNW07 treatment, in which no assumptions 
are made about color variations. However, in practice the 
assumption about the value of Ry makes little difference 
to the results: fitting with Ry = 2.2 produces the same 
precision for A z , although the resulting distance moduli 
are biased by about 0.1 mag. 

Even though the effects of host-galaxy extinction are 
included the simulation described in WNW07, this sim- 
ulation is not realistic because the shape-luminosity pa- 
rameter A is fixed to zero, and there are no intrinsic 



magnitude variations. Simulating our best estimate for 
these effects (Q, the bias and scatter in the fitted photo- 
Z and distance modulus are shown as a function of z S pec 
in Fig. [I6j For redshifts below 0.8 there is no signifi- 
cant bias in either the photo-z or distance modulus. For 
z > 0.8 the bias and scatter increase significantly. The 
nature of this high-redshift bias is illustrated in Fig [T7l 
which shows the correlation between the fitted versus 
simulated photo-z difference and the fitted versus sim- 
ulated difference for time of peak (to), extinction, and 
shape-luminosity parameter. The photo-z outliers are 
clearly correlated with to outliers. Since the SN la color 
becomes redder with epoch, a misestimate of to changes 
the apparent SN color, which is translated into an error 
in the redshift. The bias in to is an artifact of the discrete 
mlcs2k2 passbands used to characterize the rest-frame 
light curves. For SNe simulated at redshifts z > 0.8, the 



13 




three observer-frame filters map into only two rest-frame 
filters: riz — > UBB. In a small fraction of the light curve 
fits, however, the wrong filter-mapping (riz — > UBV) re- 
sults in a smaller x 2 . Since the B and U-band templates 
have different shapes, as well as a 2-day shift in the time 
of peak brightness, the fitted in is biased. 
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Fig. 16. — For MLCS2K2 simulations with peak S/N = 25, fitted 
photo-z bias and RMSa, vs. 2 spcc (left), and distance modulus 
bias and RMS^ vs. z S pcc (right). The simulations are generated 
according to the prescription of WNW07, but also including non- 
zero A and intrinsic color variations. 



8. CONCLUSIONS 

We have developed and tested a photometric redshift 
estimation method using SN la light curves within the 
framework of the MLCS2k2 and SALTii models and find 
that they result in similar photo-z precision. We used 
an iterative fitting procedure to determine the valid 
observer-frame filters to use in the fits. Applying this 
method to SDSS-II and SNLS data, we obtained an av- 
erage photo-z precision of RMSa 2 ~ 0.04. To repro- 
duce a comparable level of precision in simulations, in- 
trinsic color-smearing is needed ($3J) at the level of about 
0.1 mag per passband or 0.14 mag per color. This em- 
pirical estimate of color smearing is consistent with but 



does not necessarily imply that there are random color 
variations in SN la light curves. However, this effect does 
indicate that there are additional sources of color varia- 
tion that are not captured by the MLCS2k2 and SALTii 
light curve models. 

We applied the lcfit+Z method to simulated LSST 
samples (fj6]). For the DEEP fields, the rms scatter of 
Zphot — z gcn varies from 0.01 to 0.04 without using a host- 
galaxy photo-z prior. For the MAIN survey the photo-z 
precision is about x2 worse. Using a host-galaxy photo- 
z prior significantly reduces outliers and improves the 
overall precision. The next critical step is to apply this 
method to simulations that include non-la type SNe and 
estimate the resulting contamination of photometric SN 
la samples by core-collapse SNe. 
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APPENDIX 

A. FITTING ISSUES WITH UNKNOWN REDSHIFT 

Here we discuss our implementation of the five pre- fit issues mentioned in SQ] (1) SN selection criteria that depend 
on knowing T rcst = T b s /(1 + z) such as requiring measurements with a minimum and maximum T rcst value; (2) 
determining which observer-frame filters (with mean wavelength Af) have Af/(1 + z p hot) within the valid wavelength 
range of the fitting model; (3) determining the valid rest-frame epoch range for the fitting model; (4) as Af/(1 + z p hot) 
maps into a different rest-frame filter (for mlcs2k2) there is a discontinuous change in the model error, and therefore 
the x 2 is n °t a continuous function of z p hot; and (5) determining robust initial fit parameter values. 

The simplest way to handle SN selection criteria (issue 1) is to postpone such requirements until the fit has finished 
and then use the fitted photo-z to determine the r rGS t values. This solution is not practical, however, because we often 
wish to remove poorly sampled light curves before fitting, thereby avoiding pathological fits that are of no interest. A 
safe way to apply T rcst -dcpcndent requirements before fitting is to relax such cuts by a factor of 1 + Z max , where Z max 
is a safe upper bound on all redshifts. For example, consider the requirements of a measurement with T rcst < —6 days 
and 21 < T rost < 60 days. Using Z max = 0.5 for the SDSS-II, the pre-fitted requirements are T rost < —4 days and 
14 < T res t < 90 days. Any initial redshift value can be used to determine T rest as long as it is less than Z max . Although 
these T rest -related requirements are relaxed, they are still useful for rejecting poorly sampled light curves; the nominal 
cut is applied after the fit using the fitted photo- z value. 

To determine the valid observer- frame filters (issue 2), the first- iteration photo- z fit uses all filters, with a possible 
exception for those covering the ultraviolet region with a mean wavelength below about 4000 A. The ultraviolet filter 
should be left out of the first iteration if it is rarely used, noting that it can be added back for the second fit iteration. 
The fundamental assumption about the fitting model is that extrapolating beyond the defined wavelength range gives 
reasonable magnitude estimates so that the fit converges and that the photo-z bias is not too large. The (biased) 
photo-z estimate is then used to determine which observer-frame filters to retain and to reject for the second fit 
iteration. Note that if an ultraviolet filter is excluded initially, then a low photo-z value will result in the inclusion of 
this filter for the second iteration. If a filter is excluded, the T rost -related requirements are re-tested; the fit stops if 
the light curve no longer has adequate sampling. 

Since the first-iteration photo-z (Zp hot ) is biased and has some uncertainty, a safety margin is used to determine 

which observer- frame filters to keep. For each observer- frame filter (f) with mean wavelength Af, the valid redshift 
range for this filter is defined by 

Z m in,{ = ^fMmax 6 ' — 1 z maxi = Af/A^ 6 — 1 , (Al) 

where A™^^^ are the minimum and maximum rest-frame wavelengths defined by the model. For example, the SALTII 
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model is defined for rest-frame wavelengths 2900-7000 A; the observer-frame i-band (A-s = 7500 A) is therefore valid 
for redshifts above z m i n .i = 0.071, and g-band (X g = 4720 A) is valid for redshifts below z max>g = 0.63. For this 
analysis, a filter is kept in the second iteration if the first-iteration fitted photo-z satisfies 

z min,f + A Z phot < Zphot < z max,f — ^ Z phot ■ (A2) 

Wc have set Az$ ot = 0.04 and 0.03 for the SNLS and SDSS surveys, respectively. Azffl ot is smaller for the SDSS-II 
because the S/N for SNc at z milLti = 0.07 is larger than the S/N for SNLS SNc at z maXl9 = 0.063. The Azffl ot filter- 
selection cut has not been optimized; the optimal cut is likely to depend on the uncertainty of the host-galaxy photo-z 
and may also depend on redshift. 

This filter selection algorithm is illustrated in Fig. [T8J The solid histograms in the top two plots (SDSS data and 
simulation) show the true redshift (z spe c) distributions near z m i n ^ when the i-band is kept in the SALTii+z fit. When 
the i-band is discarded (dashed), the entire light curve is discarded because of the requirement of having three filters. 
The gap between z minii = 0.071 and the bulk distribution is due to the Azp"* t cut. The bottom two plots in Fig. IT51 
illustrate cases for LSST when a dropped filter does not result in rejecting the entire light curve. When the LSST 
g-band is included in the MLCS2K2+Z fit (solid curve, lower left), the true redshift almost always satisfies the model- 
validity requirement (z spcc < 0.5). The dashed curve shows that the g-band has been correctly excluded when the 
true redshift is above 0.5, but we have also excluded this passband in some cases where it is valid (i.e., z spcc < 0.5). 
The situation is similar for the LSST Y"-band (lower right plot). For both passbands, a small number of light curve 
fits include these bands when they should have been excluded; potential biases from these invalid passbands should 
be accounted for in the assessment of systematic uncertainties on cosmological parameters. 

The selection of rest-frame epochs (issue 3), which depends on T b s /(1 + %>hot), is made in the same way as the 
selection of filters (issue 2). All measurements are included in the first fit iteration, and the initial photo-z estimate is 
then used to select which epochs satisfy the epoch range of the model. 

The x 2 continuity (issue 4) is an issue because the SNANA minimization is based on MINUIT, and this program relies on 
computing local derivatives with respect to the fitting parameters. This means that discontinuities in the \ 2 function 
and its derivative can lead to fitting failures or pathological results. The problem can be solved by marginalizing, 
but the necessary five-dimensional integration requires ~ 100 times more CPU processing. In addition to the utility 
of using the much faster minimization program, it is useful to identify \ 2 discontinuities because such functional 
pathologies usually correspond to unphysical behavior in the light curve model, and the model should be improved 
accordingly. 

There are two main sources of % 2 discontinuity: (1) interpolating lookup tables, and (2) filter-dependent model 
parameters. For class (1), appropriate numerical methods must be used to ensure continuity in both the function and 
its derivative. For class (2), the problem occurs for rest-frame models such as MLCS2K2, in which an infinitesimal 
change in z p hot results in a different rest-frame filter for the model or a different color to use for spectral warping in the 
K -corrections. This step-function change in the model parameters results in a discontinuity in the model magnitude 
(or its error) as a function of z p hot- To prevent such sharp discontinuities, SNANA uses a smooth transition function 
(A + Bta,n^ 1 (X/T\)) to smoothly vary model parameters between neighboring filters. 

Since the redshift and SN color are somewhat degenerate, initial parameter estimation (issue 5) is important so 
that MINUIT will find a global, rather than local, minimum. The initial color and photo-z are determined from a 
crude x 2 minimization on a coarse grid: the color is varied in bins of 0.2 and the photo-z is varied in bins of 0.04. 
For each photo-z value, an initial distance modulus estimate (/^mi) is needed to ensure robust convergence of MINUIT. 
Although ^i n i is calculated from a standard ACDM cosmology using a specific set of cosmological parameters, the 
fitted distance modulus is unconstrained and is therefore not biased by the /imi calculation. For SALTii, the initial xq 
value is estimated by inverting Eq. [3] The shape- luminosity parameter is initialized to an average value: A = —0.1 for 
MLCS2k2, and x\ = 0.0 for SALTii. After the first fit iteration, the fitted parameters are used as initial values for the 
second fit iteration. When a filter is dropped and the first fit iteration is repeated, the fitted parameters are used as 
initial values, even though an invalid filter was used in the fit. If we do not use a coarse grid to initialize the color and 
photo-z, and simply start with average values, the photo-z precision for the SDSS-II sample is only slightly degraded. 
For the SNLS sample, however, which has a much larger redshift range compared to the SDSS-II, the photo-z precision 
is degraded by a factor of 2 due to a significant number of catastrophic outliers. 

B. MARGIN ALIZATION 

Here we describe some of the details related to the marginalization. The minimized values and uncertainties are used 
to estimate the integration ranges: ±4cr around the minimized value for each parameter. The integration grid consists 
of iV gr id points per fit-parameter or a total of -/V gr id 5 integration cells. We find that iVgrid = 11 is a good compromise 
between precision and computing time. To improve calculation speed per integration cell, the \ 2 calculation stops 
when the probability falls below 10 -5 . 

The marginalization is repeated for either of the following cases: (1) the probability at any integration boundary 
is greater than 0.03, or (2) more than three one-dimensional bins (i.e., marginalized over the other four parameters) 
have a probability less than 10~ 4 . In the first case the integration range is extended, while in the second case the 
integration range is reduced. 
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Fig. 18. — Illustration of filters excluded after the first fit iteration. Each plot shows the spectroscopic (i.e., true) redshift distribution 
when the indicated filter is kept in the photo-z fit (solid) and when the same filter has been dropped after the first fit iteration (dashed). 
The vertical arrow shows either z m i n f or z max f for the filter indicated. z Bpec is used only in making these plots, and is not used in the 
photo-z fits. 
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